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E0918/ 95-320 

FLOW CONTROL ARRANGEMENT IN A NETWORK 
SWITCH BASED ON PRIORITY TRAFFIC 

BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

The present invention relates to layer 2 switching of data packets in a non-blocking network 
switch configured for switching data packets between subnetworks. 

BACKGROUND ART 

5 Local area networks use a network cable or other media to link stations on the network. Each 

local area network architecture uses a media access control (MAC) enabling network interface devices 
at each network node to access the network medium. 

The Ethernet protocol IEEE 802.3 has evolved to specify a half-duplex media access 
mechanism and a full-duplex media access mechanism for transmission of data packets. The full- 

1 0 duplex media access mechanism provides a two-way, point-to-point communication link between two 
network elements, for example between a network node and a switched hub. 

Switched local area networks are encountering increasing demands for higher speed 
connectivity, more flexible switching performance, and the ability to accommodate more complex 
network architectures. For example, commonly-assigned U.S. Patent No. 5,953,335 discloses a 

15 network switch configured for switching layer 2 type Ethernet (IEEE 802.3) data packets between 
different network nodes; a received data packet may include a VLAN (virtual LAN) tagged frame 
according to IEEE 802. Id protocol that enables the network switch to perform more advanced 
switching operations. For example, the VLAN tag may specify another subnetwork (via a router) or a 
prescribed group of stations. 

20 Flow control has been proposed to reduce network congestion in a network switch, where a 

sending station temporarily suspends transmission of data packets. For example, commonly-assigned, 
U.S. Patent Nos. 5,673,254, 5,859,837 and 5,905,870 disclose arrangements for initiating and 
maintaining flow control in shared-medium, full-duplex, and switched IEEE 802.3 networks by 
generation of a PAUSE frame in a full-duplex network, or backpressure in a half duplex network. 

25 Hence, typical layer 2 flow control involves a layer 2 switch outputting pause commands to all devices 
connected to the network switch ports when internal switch resources are close to being exhausted. 

Efforts to enhance the switching performance of a network switch to include layer 3 (e.g., 
Internet protocol) processing may suffer serious drawbacks. In particular, the generation of pause 
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frames on all network switch ports adversely affects attempts to maintain advanced switching 
operations, for example maintaining high priority traffic. Hence, the generation of pause frames may 
interfere with network nodes attempting to transmit high priority traffic, resulting in a conflict between 
the necessity for congestion control and the necessity for maintaining reliable transport for high 
5 priority traffic. 

SUMMARY OF THE INVENTION 

There is a need for an arrangement that enables a network switch to perform priority based 
switching of data packets without congestion of the network switch resources. 
1 0 There is also a need for an arrangement in a network switch that enables the network switch to 

avoid congestion by outputting flow control frames without interfering with the switching of high 
priority traffic. 

These and other needs are attained by the present invention, where a network switch outputs 
flow control frames on selected network switch ports based on a determined priority of network traffic 

15 on the selected network switch ports. 

One aspect of the present invention provides a method in a network switch, the method 
comprising first determining a priority for a data frame received on a network switch port, second 
determining a depletion of network switch resources, and selectively outputting a flow control frame 
on the network switch port based on the detennined depletion of network switch resources relative to 

20 the determined priority. The determination of a priority for the data frame enables the network switch 
to provide advanced switching operations for the data frame, ensuring that quality of service may be 
provided for the received data packet. Moreover, the selective output of a flow control frame based on 
the determined depletion of network switch resources relative to the determined priority ensures that 
flow control frames are not output on network switch ports receiving high priority data frames, or at 

25 least until the network switch resources reach a substantially depleted condition. Hence, the network 
switch can maintain the reliability of traffic prioritization and quality of service while managing 
network switch resources. 

Another aspect of the present invention provides an integrated network switch that includes a 
plurality of network switch ports, each configured for receiving a data packet and selectively 

30 outputting a flow control frame in response to a flow control output signal. The switch also includes a 
flow control module configured for determining a depletion of network switch resources, the flow 
control module configured for storing, for each of the network switch ports, a corresponding 
determined priority value based on the corresponding received data packet, the flow control module 
selectively outputting the flow control output signal to selected ones of the network switch ports based 
35 on the determined depletion of network switch resources relative to the respective determined priority 
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values. The selection of the network switch ports to receive the flow control output signal enables the 
network switch to selectively output flow control frames, on a per port basis, based on the 
corresponding priority of the network switch port traffic, and based on the determined depletion of 
network switch resources. Hence, the network switch can output flow control frames on network 
5 switch ports receiving low priority traffic to relieve congestion, while maintaining the high priority 
traffic on selected network switch ports. 

Additional advantages and novel features of the invention will be set forth in part in the 
description which follows and in part will become apparent to those skilled in the art upon examination 
of the following or may be learned by practice of the invention. The advantages of the present 
10 invention may be realized and attained by means of instrumentalities and combinations particularly 
pointed in the appended claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Reference is made to the attached drawings, wherein elements having the same reference 
numeral designations represent like element elements throughout and wherein: 
1 5 Figure 1 is a block diagram of a packet switched network including multiple network switches 

for switching data packets between respective subnetworks according to an embodiment of the present 
invention. 

Figure 2 is a diagram illustrating in further detail the flow control module of Figure 1 . 
Figure 3 is a diagram illustrating the method of outputting flow control frames on selected 
20 network switch ports according to an embodiment of the present invention. 

BEST MODE FOR CARRYING OUT THE INVENTION 

Figure 1 is a block diagram illustrating a packet switched network 10, such as an Ethernet (IEEE 
802.3) network. The packet switched network includes integrated (i.e., single chip) multiport switches 12 

25 that enable communication of data packets between network stations 14. Each network station 14, for 
example a client workstation, is typically configured for sending and receiving data packets at 1 0 Mbps or 
100 Mbps according to IEEE 802.3 protocol. Each of the integrated multiport switches 12 is 
interconnected by gigabit Ethernet links 16, enabling transfer of data packets between subnetworks 18a, 
18b, and 18c. Hence, each subnetwork includes a switch 12, and an associated group of network stations 

30 14. 

Each switch 12 includes a switch port 20 that includes a media access control (MAC) module 22 
that transmits and receives data packets to the associated network stations 14 across 10/100 Mbps physical 
layer (PHY) transceivers (not shown) according to IEEE 802.3u protocol, and port filters 24. Each port 
filter 24 is configured for identifying a user-selected attribute of the data frame, for example a priority of 
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the data packet, and outputting the relevant switching information (e.g., whether the user-selected attribute 
was detected) to a switch fabric 25. The switch fabric 25 is configured for making frame forwarding 
decisions for received data packets. In particular, the switch fabric 25 is configured for layer 2 switching 
decisions based on MAC source address, MAC destination address, and VLAN information within the 
5 Ethernet (IEEE 802.3) header; the switch fabric 25 is also configured for implementation of user-defined 
switching policies based on detection of the user-selected attributes within the Ethernet packet, described 
in detail below. 

As shown in Figure 1 , each switch 12 has an associated host CPU 26 and a buffer memory 28, 
for example an SSRAM. The host CPU 26 controls the overall operations of the corresponding switch 

10 12, including programming of the port filters 24 and the switch fabric 25. The buffer memory 28 is 
used by the corresponding switch 12 to store data frames while the switch fabric 25 is processing 
forwarding decisions for the received data packets. 

The switch fabric 25 is configured for performing layer 2 switching decisions and switching 
decisions mat implement user-defined switching policies; such user-defined switching policies may 

15 include granting sufficient switch resources to ensure a guaranteed quality of service (e.g., reserved 
bandwidth or guaranteed latency) for a received data packet having a prescribed user-selected attribute, for 
example having been received on a selected ingress port, having a prescribed MAC or IP source or 
destination address, or having been determined to belong to a prescribed flow, for example an IGMP 
media flow, or having a determined priority. The availability of switching decisions that implement user- 

20 defined switching policies including layer 3 switching decisions and above, also referred to herein as 
policy-based switching decisions, enables the switch fabric 25 to make intelligent decisions as far as how 
to handle a packet, including advanced forwarding decisions, and whether a packet should be considered a 
high-priority packet for latency-sensitive applications, such as video or voice. 

Instances may arise where the network traffic flowing through the network switch 12 may begin 

25 to create a substantial depletion of network switch resources such as buffer capacity, creating congestion 
in the network switch. If flow control frames, for example backpressure in a half-duplex network or pause 
frames in a full duplex network, were transmitted from all the network switch ports 20 to the network 
stations 14 according to IEEE 802.3 protocol, the flow control frames may interfere with the policy-based 
switching decisions for high-priority network traffic. Such interference with high priority traffic is 

30 undesirable especially if the congestion can be alleviated merely by temporarily suspending the low 
priority traffic. 

According to the disclosed embodiment, the network switch 12 includes a flow control module 30 
that determines which of the network switch ports 20 should output a flow control frame based on the 
determined depletion of network switch resources and based on the corresponding priority value of the 
35 network traffic on each network switch port. Hence, any network switch port 20 that receives high 
priority traffic does not output a pause frame to the corresponding network station 14, enabling that 
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network station 14 to continue transmission of the high priority traffic. In most cases, the congestion and 
depletion of network switch resources can be alleviated by sending flow control frames on only those 
network switch ports that receive lower priority traffic, enabling the network switch 12 to reduce 
congestion without interfering with high priority traffic. 
5 As described below, the determination of whether a data packet is designated as high priority 

traffic is programmable by a user. Although the determination of a priority for the data frame received on 
a given network switch port 24 could be determined centrally by the flow module 30, the disclosed 
embodiment performs distributed packet classification in each of the network switch ports 20. In 
particular, each port filter 24 of Figure 1 is configured for identifying user-selected attributes from a 
10 received layer 2 frame. The port filter 24 can be implemented as a state machine that monitors the bytes 
coming in from the network, hence the state machine can analyze the data frame for the presence of 
prescribed user-selected attributes (such as the priority of a packet) on a per-byte basis as the bytes of 
packet data of the data frame are received by the network switch port. In addition, the port filter 24 can be 
configured for multiple simultaneous comparisons of the incoming packet data with multiple templates 
1 5 that specify respective user-selected attributes, enabling the port filter 24 to simultaneously determine the 
presence of a plurality of user-selected attributes as the data frame is received. 

Figure 2 is a diagram illustrating in further detail the flow control module 30 according to an 
embodiment of the present invention. Figure 3 is a diagram illustrating the method of selectively 
outputting flow control frames on selected network switch ports according to an embodiment of the 
20 present invention. The flow control module 30 includes a flow control generator 32 having an aging 
resource 34, a resource monitor 36, a port table 40, and a threshold-priority table 42. The port table 40 is 
configured for storing the determined priority values for the respective network switch ports 20, received 
by the flow control generator 32 from the network switch ports 20. For example, assume the network 
switch 12 includes network switch ports designated A, B, C, D, E, and F; the flow control generator 32, in 
25 response to receiving the port priority values from the respective switch ports 20, stores the port priority 
values in the port table 40. According to the example illustrated in Figure 2, the network switch ports A, 
D and F have zero priority traffic, network switch port B has level 1 priority traffic, network switch port C 
has level 3 priority traffic, and network switch port E has level 5 priority traffic. 

The flow control generator 32 is configured for selectively outputting flow control output signals 
30 to selected network switch ports based on the determined depletion of network switch resources relative to 
the respective determined priority values. For example, the resource monitor 36 is configured for 
monitoring a free buffer queue 50, within the network switch 12, that stores unused frame pointers that 
specify a corresponding buffer memory location in the memory 28 available for storage of frame data. 
Hence, if there is no frame data stored in the memory 28, then the free buffer queue is filled at 100 
3 5 percent; conversely, congestion and depletion of network switch resources (e.g., available buffer storage in 
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the buffer memory 28) begin to occur as the number of frame pointers in the free buffer queue 50 
approaches zero. 

The threshold priority table 42, programmed by a user of the host CPU 26 in step 60, is 
configured for storing prescribed resource threshold values (e.g., Tl, T2, and T3) and respective user- 
5 defined priority thresholds (e.g., "2", "4" and "6") that specify the minimum port priority value necessary 
to avoid generation of a flow control frame on the corresponding network switch port 20. After the 
threshold priority table 42 has been programmed by the user, the network switch 12 can begin switching 
operations. 

As described above, each of the network switch ports 20 includes a port filter 24 configured for 

1 0 determining a priority for a data frame received on the corresponding network switch port. Each port filter 
24, upon determining the priority for the data frame, forwards the determined port priority value to the 
flow control generator 32 for storage in the port table 40 in step 62. 

The resource monitor 36 supplies to the flow control generator 32 the number of frame pointers in 
the free buffer queue 50, indicating the availability of network switch resources. The flow control 

1 5 generator 32, upon determining the number of free buffer pointers in the free buffer queue 50, determines 
in step 64 whether there is a depletion of network switch resources based on the resource thresholds Tl, 
T2, and T3 stored in the threshold priority table 42; if a depletion of network switch resources is detected, 
the flow control generator 32 compares each of the port priority values stored in the port table 40 in step 
66 with the corresponding user-defined priority threshold in the threshold priority table 42 to determine 

20 whether a flow control output signals should be output. For example, if the flow control generator 32 
determines that the number of frame pointers in the free buffer queue 50 falls below the threshold Tl, T2, 
or T3, the flow control generator 32 determines which network switch ports 20 have a port priority value 
less than "2", "4", or "6", respectively. Assuming that the network switch resources fell below the 
threshold T2, the flow control generator 32 would output a flow control output signal in step 68 to the 

25 network switch ports A, B, C, D, and F, causing those network switch ports to output flow control frames 
(e.g., backpressure or pause frames); however the network switch port E, having a port priority value 
greater than the user-defined priority threshold value of "4", does not receive the flow control output 
signal, hence can continue to receive the level 5 high-priority traffic. 

The flow control generator 32 also includes an aging resource 34, enabling the flow control 

3 0 generator 32 to delete aged port priority values in step 70. 

According to the disclosed embodiment, a flow control module selectively determines whether a 
network switch port should output a flow control frame based on the determined depletion of network 
switch resources relative to a determined priority of data traffic received by the network switch port. 
Hence, high-priority traffic can be maintained to satisfy quality of service requirements, while reducing 

3 5 congestion by temporarily suspending lower priority traffic. 
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While this invention has been described with what is presently considered to be the most 
practical preferred embodiment, it is to be understood that the invention is not limited to the disclosed 
embodiments, but, on the contrary, is intended to cover various modifications and equivalent 
arrangements included within the spirit and scope of the appended claims. 



What is Claimed Is: 



1 . A method in a network switch, the method comprising: 

first determining a priority for a data frame received on a network switch port; 
second determining a depletion of network switch resources; and 

selectively outputting a flow control frame on the network switch port based on the determined 
depletion of network switch resources relative to the determined priority. 

2. The method of claim 1, wherein the first determining step includes determining the 
priority for the data frame at the network switch port. 

3. The method of claim 1, further comprising storing the determined priority within a 
table configured for storing the determined priority for each of a plurality of the network switch ports. 

4. The method of claim 3, wherein the second determining step includes determining 
whether an availability of the network switch resources falls below a first prescribed threshold value. 

5. The method of claim 4, further comprising setting the first prescribed threshold value 
based on a user-defined priority threshold. 

6. The method of claim 5, wherein the setting step includes setting a plurality of 
prescribed threshold values, including the first prescribed threshold value, based on a plurality of the 
user-defined priority threshold, respectively. 

7. The method of claim 6, wherein: 

the first determining step includes determining the priority from a plurality of available 
priority values; 

the second determining step includes detenrrining whether the availability of the network 
resources has fallen below an identified one of the prescribed threshold values; and 

the selectively outputting step includes identifying from the table the network switch ports 
having respective priority values less than the corresponding user-defined priority threshold for the 
identified one prescribed threshold value. 
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8. The method of claim 6, wherein the step of setting the plurality of prescribed threshold 
values includes storing the prescribed threshold values and the respective user-defined priority 
thresholds in a second table. 

9. The method of claim 3, further comprising deleting the determined priority from the 
table after a prescribed aging interval. 

10. The method of claim 3, further comprising setting a plurality of prescribed threshold 
values based on a plurality of respective user-defined priority thresholds. 

1 1 . The method of claim 1 0, wherein: 

the first determining step includes determining the priority from a plurality of available 
priority values; 

the second determining step includes determining whether the availability of the network 
5 resources has fallen below an identified one of the prescribed threshold values; and 

the selectively outputting step includes identifying from the table the network switch ports 
having respective priority values less than the corresponding user-defined priority threshold for the 
identified one prescribed threshold value. 

12. An integrated network switch comprising: 

a plurality of network switch ports, each configured for receiving a data packet and selectively 
outputting a flow control frame in response to a flow control output signal; and 

a flow control module configured for determining a depletion of network switch resources, the 
5 flow control module configured for storing, for each of the network switch ports, a corresponding 
determined priority value based on the corresponding received data packet, the flow control module 
selectively outputting the flow control output signal to selected ones of the network switch ports based 
on the determined depletion of network switch resources relative to the respective determined priority 
values. 

13. The switch of claim 12, wherein each network switch port includes a port filter 
configured for determining the determined priority value for the corresponding data packet. 

14. The switch of claim 13, wherein the flow control module includes a first table 
configured for storing the determined priority values for the respective network switch ports, and a 
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second table configured for storing a plurality of prescribed resource threshold values and respective 
user-defined priority thresholds, the flow control module configured for determining whether the 
5 availability of the network resources has fallen below an identified one of the prescribed resource 
threshold values. 

15. The switch of claim 14, wherein the flow control module is configured for selecting 
the selected ones of the network switch ports based on the respective determined priority values being 
less than the corresponding user-defined priority threshold for the identified one prescribed resource 
threshold value. 

16. The switch of claim 14, further comprising a free buffer queue configured for storing 
unused frame pointers, each unused frame pointer specifying a corresponding buffer memory location 
available for storage of frame data, the flow control module configured for determining the depletion 
of network switch resources based on a comparison between a number of the unused frame pointers in 

5 the free buffer queue relative to the prescribed resource threshold values. 

17. The switch of claim 12, wherein the flow control module deletes the determined 
priority value for a selected one of the network switch ports after a prescribed aging interval. 
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E0909/ 95-311 



ARRANGEMENT IN A NETWORK SWITCH FOR 
PRIORITIZING DATA FRAMES BASED ON USER- 
DEFINED FRAME ATTRIBUTES 



ABSTRACT OF THE DISCLOSURE 

A network switch includes network switch ports, each including a port filter configured for 
detecting user-selected attributes from a received layer 2 frame. Each port filter, upon detecting a 
user-selected attribute in a received layer 2 frame, sends a signal to a switching module indicating the 
determined presence of the user-selected attribute, for example whether the data packet has a 
5 prescribed priority value. The network switch includes a flow control module that determines which of 
the network switch ports should output a flow control frame based on the determined depletion of 
network switch resources and based on the corresponding priority value of the network traffic on each 
network switch port. Hence, any network switch port that receives high priority traffic does not output a 
flow control frame to the corresponding network station, enabling that network station to continue 
10 transmission of the high priority traffic. In most cases, the congestion and depletion of network switch 
resources can be alleviated by sending flow control frames on only those network switch ports that receive 
lower priority traffic, enabling the network switch to reduce congestion without interfering with high 
priority traffic. 
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